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Large Scale Structure in the ELAIS SI Survey 
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ABSTRACT 

We present an analysis of the two-point angular correlation function of the ELAIS SI 
survey. The survey covers 4 deg 2 and contains 462 sources detected at 15pm to a 5a 
flux limit of 0.45 mJy. Using the 329 extragalactic sources not repeated in different 
observations, we detect a significant clustering signal; the resulting angular correlation 
function can be fitted by a exponential law w(9) = AO 1 ^ with A = 0.014 ±0.005 and 
7 = 2.04 ± 0.18. Assuming a redshift distribution of the objects, we invert Limber's 
equation and deduce a spatial correlation length ro = 4.3+q'7 h~ x Mpc. This is smaller 
than that obtained from optical surveys but it is in agreement with results from 
IRAS. This extends to higher redshift the observational evidence that infrared selected 
surveys show smaller correlation lengths (i.e. reduced clustering amplitudes) than 
optical surveys. 

Key words: galaxies: clusters: general - galaxies: evolution - infrared: galaxies - 
cosmology: observations - large-scale structure of Universe 



1 INTRODUCTION 

Theories of structure formation were strongly constrained 
by the statistical measurements of clustering in some of the 
early galaxy redshift surveys. Surveys of infrared galaxies, 
in particular, were a ble to rule out the then standard Cold 
Dark Matter Model (lEfstathiou et alJll99(j ISaunders et alJ 
11991ft. Present day redshift surveys such as the 2dFGRS 
l|Colless et, all 1200 j> . SDSS {York et alJboOCh and, in the 
fa r-infrared, the Point S ource Catalog Redshift survey, PSC- 
z ijSaunders et al.l|2000t) are now able to provide definitive 
measurements of the galaxy clustering in the local Universe. 

Despite this success, we have always known that galax- 
ies are biased tracers of the matter distribution and yet we 
have a poor observational or theoretical understanding of 
this bias, although it is assumed to be related to the process 
of galaxy formation and evolution. To understand bias and, 
by inference, galaxy formation, we need to better understand 
the clustering of different galaxy types and the evolution of 
this clustering with redshift. 

In this paper we attempt to provide an estimate of the 
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clustering of infrared galaxies a factor of ten deeper (in red- 
shift) than those seen in the IRAS surveys. To do this we 
provide the first estimate of clustering from any of the ex- 
tragalactic ISO surveys. This is thus the first estimate of 
clustering from galaxies selected at 15pm. We have used 
part of the ELAIS survey (Oliver et al. 2000) as this probes 
the largest volume of any of the ISO surveys. We measure 
the projected clustering by calculating the angular correla- 
tion function, we then discuss the constraints this places on 
the three dimensional clustering using Limber's equation. 



2 THE ELAIS SI SURVEY 

The European Large- Area ISO survey (ELAIS, Oliver et al. 
2000) was the largest Open Time programme on ISO. This 
project surveyed 12 square degrees, larger than all but the 
serendipitous surveys, making it ideal for clustering studies. 
The main survey bands were at 6.7, 15, 90 and 170 /im. Of 
these bands the 15/xm catalogues contain the greatest den- 
sity o f galaxies (see e.g. lGruppioni et al l2002USerieant et alJ 
2000), and provide the best statistics for clustering. The fi- 
nal analysis of the 15pm data using the Lari method for 
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Figure 1. Selection function used to generate the random sample of galaxies. The dark central region arises due to the deeper observations 
carried out in. 



one of the ELAI S fields (SI) has recently been completed 
llLari et alJl200ll) and this is the sample that we use in this 
analysis. 

The SI field is located at a(2000) = 00 h 34 m 44.4 s , 
5(2000) = -43°28'12", covering an area of 2 x 2 square 
degrees. The 15u.m survey is made from 9 raster observa- 
tions, each one of ~ 40' x 40'. The central raster Sl_5 has 
been observed three times. Using the Lari method we have 
obtained a sa mple of 462 sources t o 5a in the flux range 
0.45-150 mJy jGruppioni et alJl2002Tl . 



part of the survey, r it would have had a different signal-to- 
noise. We can define then a mask, Mi(r), which represents 
the detectability of each object as a function of position as 
follows 



Mi(r) 



if cfi{r) < <r„ 

1 if Oi(r) > a n 



(1) 



where a m in = 5. The un-normalised selection function 
can then be written as 



(2) 



which can be normalised 



3 SELECTION FUNCTION 

Besides the galaxy catalogue itself, the selection function is 
the most important ingredient in the calculation of cluster- 
ing statistics. Errors in the selection function will invalidate 
the answer, whereas errors in the weighting scheme will usu- 
ally make the answer more noisy. 

A selection function is required for each source list that 
is being investigated. The selection function, (f>, is defined 
as the expected number density of sources as a function 
of r (which might be two or three dimensional), in ab- 
sence of clustering; i.e., the expected number of galaxies 
dM in a volume dV is dM = cj)(r)dV. With this definition, 
J (f)(r)dV = N '. The selection function is used to simulate a 
catalogue with no clustering. 

To be selected from the ELAIS SI catalogue sources 
had to exceed a signal-to- noise threshold, a m i„. The signal 
to noise of a detected source i, is ai(n) = Si/N(n) where 
Si is the signal of the source and N(n) is the noise at the 
position of the source. Had this source been in a different 



jftdV 



(3) 



3.1 Building the masks 

In the full ELAIS SI region there are 9 independent noise 
maps N(r), corresponding to 9 independent sub-catalogues. 
Note that the central noise map is less noisy and the corre- 
sponding sub-catal ogue deeper, because t he ISO data were 
already combined IIGruppioni et al.N2002fl . We constructed 
a selection function as follows: for each source in the sub- 
catalogue we calculate the hypothetical signal-to-noise ratio 
(defined as the peak flux over the rms value) at each point 
in the raster. Where these exceed the extraction signal-to- 
noise threshold a m % n (equation^, the value of the selection 
function at that position is incremented (equation |5J. 

The 9 individual selection functions are then combined 
into a single one. Figure^shows the final image. In the over- 
lap region only one selection function was used and the final 
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catalogue excludes sources in that region that arose from 
the other sub-catalogues. S ources with stellar cou nterparts 
have also been removed (see lGruppioni et al.l2002T ) from the 
catalogue and excluded from the calculation of the selection 
function. We end up with a catalogue of 329 sources. 

The selection function so obtained is then used to gen- 
erate the random catalogues with no clustering, essential to 
properly calculate the two point correlation function. 



4 THE ANGULAR CORRELATION 
FUNCTION 

Correlation functions are widely used to study the distribu- 
tion of sources in surveys and to derive large scale properties 
of galaxies. The two-point spatial correlation function is de- 
fined so that 

dP = n 2 [l + Z(r)]dVidV 2 

is the joint probability of finding a source in a volume ele- 
ment dVi and another source in a volume element dVi- The 
function £(r) is the excess probability of finding an object 
compared to a random distribution of objects. 

Similarly, one can define the two point angular correla- 
tion function so that 

dP = n 2 [l + w(6)]dn 1 dn 2 

is the joint probability of finding a source in a solid angle 
element dfii and another source in a solid angle element 
dSl2- These two statistics are related by Limber's equation 
(Peebles 1980). 

In order to calculate the a ngular correlatio n function 
of mid-IR sources we use the lLandv fc Szalavl estimator 
jLandv fc Szalavlll993tl 



w{9) 



[DP] - 2 [PR] + [RR] 
[RR] 



(4) 



where [PP] , [RR] and [PR] represents the normalized num- 
ber of galaxy-galaxy, random-random and galaxy-random 
pairs with angular separation in (9, 9 + d9). 

Errors in the calculation of the angular correlation func- 
tion are dominated by Poisson noise. The erro r in each bin 
can be estimated using the following expresion feaugh et alJ 
Il996bl) : 



5w{6) = 2 



l + w{0) 
DD 



(5) 



where, in this case, DD is the total number of galaxy-galaxy 
pairs (not normalized) . Errors calculated using this equation 
are comparable to t he errors obtained from a boo tstrap re- 
sampling technique <Ling. Barrow, fc Frenklll986f) . 

A second source of errors comes from the to finite size 
of the sample. In order to correct for this effect we use the 
random sample to calculate the integral constraint as (e.g. 
Infante et al. 1994) 



Y,N rr {e) 



J2N rr (6)(l + w{9)) 



(6) 



and divide the calculated correlation function by this factor, 
S = 0.945. 

Figure |5| (top) shows the obtained angular correlation 
function, calculated using 200 realizations with 2000 random 
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Figure 2. Angular correlation function as calculated from ELAIS 
SI. Top figure shows the calculation performed using all sources, 
while in the bottom figure only those sources with fluxes larger 
than 0.7 mjy have been considered (excluding then, those faint 
sources only detectable in the central deeper raster Sl_5). Data 
points are fitted by a exponential law w(8) = A0 1 with A = 
0.014 ± 0.005 and 7 = 2.04 ± 0.18. 



sources each, in intervals of log# = 0.08 degrees. Random 
catalogues have been built using the selection function cal- 
culated in previous section. Pata points have been fitted by 
an exponential law of the form w(9) = A8 1 ' resulting in 
A = 0.014± 0.005 and 7 = 2.04±0.18 (where 9 is measured 
in degrees). 

The mean number of objects in each field is (n) = 35.25 
(excluding the central raster Sl_5, with 71 sources), with 
a standard deviation of 6 objects. Since Sl_5 field reaches 
deeper flux limits, it could in principle be subject to cluster- 
ing variations that would affect the whole clustering estima- 
tion. We repeat the above calculation removing those sources 
with fluxes fainter than the flux limit excluding Sl_5: a total 
of 27 sources with fluxes lower than 0.7 mjy are removed. 
By calculating again the selection function and the angu- 
lar correlation function we obtain then an estimate of the 
clustering from sources detectable over most of the SI field. 
In this case the integral constraint is slightly larger than 
the previously obtained, S = 0.970. The angular correlation 
function is shown in figure [5] (bottom) and is fitted by the 
same exponential law previously calculated. Although the 
correlation function is now slightly larger at scales ~ 0.1 deg, 
the errors are also larger and the overall correlation function 
is well fitted by the previous function. 
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Figure 3. Top: Redshift distribution of ELAIS objects obtained 
from followup spectroscopic observations and photometric red- 
shifts. Bottom: Model redshift distribution of ELAIS sources ob- 
tained by Pozzi et al. (2004). 



5 SPATIAL CORRELATION FUNCTION 

In case of small angles, both w and £ can be approximated 
by power law shapes, a nd the spatial correl ation function 
can be written as (e.g., iPhillipps et aljll97ctl 



(1 + *) 



-(3+€) 



(7) 



where ro is the comoving correlation length at z = and 
r the comoving distance. The parameter e is the cluster- 
ing evolution index and is interpreted as follows. A value 
e = corresponds to stable clustering in physical coordi- 
nates, i.e., galaxy clusters remain unchanged and clustering 
changes due to the expansion of the Universe, while e = 3 — 7 
corresponds to clustering fixed in comoving coordinates, i.e., 
clustering does not change with time and the galaxy clusters 
expand with the Universe. 

If w(8) is parametrized as w(8) = A w 6 f , t hen Lim- 
ber's equation becomes (e.g.. IPhillipps et al.lll97cl) 



A w — CYq 



/ D 1 e - 1 g- 1 (z)(l + z)- {3+e) (dN/dz) 2 dz 
(J{dN/dz)dz) 2 



(8) 



where Dg is the angular diameter distance, g{z) is the scale 
factor multiplied by the element of comoving distance 



9{Z) = 1T [{1 + 2)2(1 + ^) 1/2 1 



and 



C 



1/2 r[( 7 -i)/2] 
r(7/2) 



(9) 



(10) 



The only unknown quantity in equation |H| is the redshift 
distribution of the sources. A distribution given by 



dN 

dz 



2z'i 



-z exp 



3/2' 



where z m = \plz c is the median redshift of the survey, 
provides good fits to the distribution of optical galaxies 
( Baugbl ll996al . and references therein) and as been widely 
used to invert Limber's equation. However, the redshift dis- 
tribution of mid-IR sources is more extended and the equa- 
tion above is no longer valid. Figure |3] (top) shows the red- 
shift distribution of the ELAIS sources, obtained from op- 
tical spectroscopy (La Franca et al, 2004, in prep; Perez- 
Fournon et al, 2004, in pre p) and photometric redshifts 
iRowan- Robinson et aljf2003l) . Instead of the optical dN/dz 



dN 
dz 



oc z exp 



3/4 



(11) 



which fits reasonably well the distribution of ELAIS sources. 

We can now integrate equation |H1 in order to obtain ro. 
Assuming Qo = 1-0 and e — and using a median redshift 
of z m = 0.1 in equation 1111 . we obtain ro = 4.3 ft" 1 Mpc 
with a 95% confidence level in the range 3.8 ft" 1 Mpc to 
4.7 ft" 1 Mpc. 

An alternative redshift distribution of the ELAIS 
sources has been presented bv lPozzi et alJ , computed 

from the luminosity function fit of galaxies on ELAIS SI and 
S2 (see figure 0- bottom). When using this redshift distri- 
bution as input to equation |H1 we obtain similar results of 
ro = 4.2 ft" 1 Mpc with a 95% confidence level in the range 
3.6 ft" 1 Mpc to 4.8 ft" 1 Mpc. 



6 DISCUSSION 

We use a sample of 329 15^im galaxies detected in the ELAIS 
SI survey covering a region of 4 square degrees of sky to 
determine the angular correlation function of the galaxies. 
We measure w(9) up to scales of 1 degree, corresponding to 
~ 7 Mpc at the mean redshift of z = 0.14. The resulting cor- 
relation function is well fitted by a power law w(6) = A8 1 ' 
with A = 0.014 ± 0.005 and 7 = 2.04 ± 0.18. Assuming a 
redshift distribution we invert the angular correlation us- 
ing Limber's equation in order to determine the correlation 
length ro. We find a value of ro = 4.3 ft" 1 Mpc with a 95% 
confidence level in the range 3.8 ft" 1 Mpc to 4.7 ft" 1 Mpc. 

Table lists the correlation lengths obtained from sev- 
eral optical and infrared surveys. While optical surveys show 
a significant clustering with ro ~ 5 ft" 1 Mpc, IRAS sur- 
vey shows a lower value. The data described in this pa- 
per is consistent with this trend: that mid infrared selected 
sources show a smaller correlation length. This is expected 
because optical surveys favour elliptical galaxies which are 
more strongly clustered than spiral galaxies, while infrared 
surveys preferentially select spirals and star-forming galax- 
ies. We note that the ELAIS selected galaxies appear to have 
a marginally steeper two point correlation function than the 
optical and the IRAS surveys. By fixing 7 to the lower value 
allowed by the fit 7 = 1.86 we obtain ro = 4.11q'j, bringing 
the clustering amplitude closer to that seen by IRAS. 
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Table 1. Summary of correlation lengths values obtained from 
different surveys. 



Survey 



''0 



7 



Ref. 



APM 

SDSS 

2dFGRS 

IRAS 

PSCz 

ELAIS 



0.05 
0.18 
0.08 
0.02 
0.02 
0.14 



5.7 
5.7±0.2 
4.92±0.27 
3.79±0.14 
3.7 

4 cj+0.4 



1.67 
1.75±0.03 
1.71±0.06 
1.57±0.03 

1.69 
2.04±0.18 



References: 1 iMaddox et"ai]|l990t 2 IZehavi et al. 2002 (assum- 
ing a Einstein-de S itt er cosmology); 3 INorberg et al. l2001i 4 
ISaunders et al.lll992t 5 lJing et al]|2002l 



It is interest ing to note that the optical correlation fu nc- 
tion at z ~ 0.2 iZehavi et alJl200 St INorberg et al.ll200lf) is 
consistent with that at z ~ (Ma ddox et alll99fl) . i.e. there 
is no apparent evolution in the correlation function over this 
albeit small redshift range. Likewise the infrared galaxy cor- 
relation function estimate from this work at z ~ 0.14 is 
consistent with the infrared galaxy c orrelation functio n esti- 
mated from IRAS galaxies at z = Jjing et alJl2002t . This 
apparent lack of evolution may be because evolutionary ef- 
fects are small or that evolution in mass clustering is com- 
pensated by evolution in galaxy bias. It will be interesting 
to see if this apparent non-evolution in clustering of differ- 
ent galaxy population mixes is still found in deeper surveys 
(e.g. SWIRE, Lonsdale et al. 2003) as this might imply a 
striking conspiracy in the evolution of the bias in different 
galaxy types. 

By performing a systematic analysis of all density peaks 
in the redshift distribution of field galaxies, lElbaz fc Movl 
(2004) recently found an excess of ISO selected galaxies 
over the whole range of density peaks. This suggests than 
infrared galaxies are more strongly clustered than optical 
galaxies, in apparent contradiction to our results. Since the 
ISO-CAM surveys are deeper than ELAIS, it is plausible 
that there is an evolution of the clustering over the redshift 
interval spanned by these two ISO surveys. Infrared galaxies 
would then become more clustered towards higher redshift, 
while the clustering of optical galaxies changes little. This 
would agree qualitatively with hierarchical pictures of struc- 
ture formation. Such models predict that star formation and 
galaxy formation is driven by merger rate that would be a 
function of environment. Star formation would thus have 
occurred first and vigorously in the denser (more clustered) 
regions of the Universe, taking more time to initiate in lower 
densi ty (less clustered) regions (see also lElbaz fc Cesarskvl 
2003). The strong evolution in luminosity function of in- 
frared galaxies (e.g. Pozzi et al. 2004) might then be cou- 
pled with an evolution in their clustering. Optically selected 
galaxies sample regions where past star formation activity 
was high as well as those where current activity is high and 
are thus less sensitive to these effects. Finally this appar- 
ent contradiction may simple be that the deep ISO-CAM 
surveys do not sample a sufficiently large volume to be rep- 
resentative of the rest of the Universe. 
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